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1. 


Introduction 


In  this  report*  we  discuss  relational  programming,  i.e.  a 
style  of  programming  in  which  entire  relations  are  manipulated 
rather  than  individual  data.  This  is  analogous  to  functional 
programming  [1],  wherein  entire  functions  are  the  values  manipu¬ 
lated  by  the  operators.  We  will  see  that  relational  programming 
subsumes  functional  programming  because  every  function  is  also  a 
relation.  It  is  appropriate  at  this  point  to  discuss  why  we  have 
chosen  to  investigate  relational  programming.  The  reader  can 
find  a  shorter  introduction  to  relational  programming  in  [12]. 

As  we  have  noted,  relational  programming  subsumes  functional 
programming;  hence,  anything  that  can  be  done  with  functional 
programming  can  be  done  with  relational  programming.  Further¬ 
more,  relational  programming  has  many  of  the  advantages  of  func¬ 
tional  programming:  for  instance,  the  ability  to  derive  and 
manipulate  programs  by  algebraic  manipulation.  A  well  developed 
algebra  of  relations  dates  back  to  Boole's  original  work  and  has 
been  extensively  studied  since  then.  Although  relations  are  more 
general  than  functions,  their  laws  are  often  simpler.  For 
instance,  (fg)-1  *  g~1f“1  is  true  for  all  relations,  but  true 
only  for  functions  that  are  one-to-one.  Also,  relational  pro¬ 
gramming  more  directly  supports  non-linear  data  structures,  such 
as  trees  and  graphs,,  than  does  functional  programming.  In  rela¬ 
tional  programming  the  basic  data  values  are  themselves  rela¬ 
tions,  whereas  in  functional  programming  there  is  a  separate 
class  of  objects  (lists)  used  for  data  structures.  One  final 
reason  for  investigating  relational  programming  is  that  it  pro¬ 
vides  a  possible  paradigm  for  utilizing  associative  and  active 
memories.  As  a  teaser  for  what  is  to  come,  we  present  the  fol¬ 
lowing  example  of  a  relational  program.  We  will  take  a  text  T, 
represented  as  an  array  of  words  (i.e.,  T:i  is  the  i-th  word), 
and  generate  a  frequency  table  F  so  that  F:w  is  the  number^of 
occurences  of  word  w  in  T.  Now  we  will  see  (section  4)  that  T:w 
is  the  set  of  all  indices  of  the  word  w.  If  we  let  #:C  be  the 
cardinality  of  a^class,  then  the  number  of  indices^  (occurences) 
of  w  is  just  #:(T:w).  Therefore  we  can  write  F  »  #T  (Section  7). 


2.  Classes  and  Relations 
2.1  basic  concepts 


Our  relational  calculus  will  deal  with  three  sorts  of 
things:  individuals,  classes  and  relations.  These  can  best  be 

illustrated  by  example.  If  'x'  is  the  name  of  an  individual  and 
'C'  is  the  name  of  a  class,  then  ' x9C '  means  that  the  individual 
denoted  by  'x'  is  a  member  of  the  class  denoted  by  'C*  (i.e., 

*  The  work  reported  herein  was  supported  by  the  Foundation 
Research  Program  of  the  Naval  Postgraduate  School  with  funds 
provided  by  the  Chief  of  Naval  Research. 
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that  x  has  property  C) .  Thus  'Ar istotleGman '  would  indicate  that 
Aristotle  is  a  man,  and  ' 2Geven '  would  mean  that  2  is  an  even 
number.  (The  symbol  'G'  is  an  abbreviation  for  Gctri ,  which  is 
the  Greek  word  for  'is'.) 

If  'x'  and  'y'  are  names  of  individuals  and  'R'  is  the  name 
of  a  relation,  then  'x  R  y*  means  that  x  bears  the  relation  R  to 
y.  For  example. 


Aristotle  student  Plato 

means  that  Aristotle  is  a  student  of  Plato.  Also,  '2  <  3'  means 
that  2  bears  the  less-than  relation  to  3,  i.e.,  that  2<3.  Where 
there  is  little  chance  of  confusion,  'x  R  y'  will  be  written 
'xRy'  and  'xGP'  will  be  written  'xP'.  The  notation  that  we  have 
introduced  above  will  be  extended  to  classes  of  classes,  classes 
of  relations,  relations  among  classes,  relations  among  relations, 
etc . 


2. 2  relational  descriptions 

If  'S(x)'  is  a  sentence  involving  'x',  then  a  class  descrip¬ 
tion  is  an  expression  of  the  form  'x(S(x))'.  This  denotes  the 
class  of  all  individuals,  a,  for  which  S(a)  is  true,  i.e., 

a  G  x(S(x)  )  S  (a) 

Similarly,  if  'S(x,y)'  is  a  sentence  involving  ’x’  and  'y',  then 
*x9(S(x,y))'  is  a  relation  description  which  holds  between  a  and 
b  whenever  S(a,b)  is  true,  i.e., 

a  [  x? (S  (x,y) )  ]  b  S (a ,b) 

To  illustrate  this  notation  we  will  define  the  converse  of  a 
relation. 

2. 3  converse 

The  relation  R-^-  is  called  the  converse  of  R,  i.e. 
xR-1y  yRx.  Using  our  notation  for  descriptions  we  can 

define , 


r“^  =  x9(yRx) 

As  an  example  of  a  relation  among  relations,  we  define  as  the 

relation  that  holds  between  converses: 

r  '  s  r»s”^ 


Hence 


? § ( r*s“* ) 


Some  examples  of  converses  are: 

parent-^-  =  child 
<~1  =  > 

The  following  are  easily  proved  properties  of  the  converse: 


r ' s  s  '  r 

i  ~  J.  _  i 

2. 4  arrow  diagrams 

Relations  can  be  portrayed  by  "arrow  diagrams"  (Haase 
diagrams) .  In  such  a  diagram  there  is  a  node  for  each  individual 
related  by  the  relation  and  an  arrow  from  x  to  y  whenever  xRy. 


represents  the  relation  R  such  that 

bRa,  cRb,  dRb,  eRd,  eRe,  bRe 


and  ~xRy  for  all  other  cases.  The  effect  of  the 

tor  is  to  reverse  all  of  the  arrows.  Hence,  R-* 

'C 


converse  opera- 
is  diagrammed: 


2.5  tables 


Relations  can  often  be  viewed 
relation  R  of  the  previous  section 


as  tables.  For  instance, 
can  be  shown  as  a  table: 


the 


R 


b 

a 

c 

ns 

d 

b 

e 

d 

e 

e 

b 

e 

Of  course,  it  makes  no  difference  in  what  order  we  write  the  rows 
of  the  table. 
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The  converse  of  a  relation  is  obtained  by  simply  exchanging 
the  columns  of  the  table: 


Of  course,  classes  are  represented  by  one  column  tables.  For 
instance  the  class  C  of  primes  less  than  ten  is: 


C 


2 

3 

5 

7 


3 .  Domains 

We  often  need  to  talk  of  the  individuals  that  can  occur  on 
the  right  or  left  of  a  relation.  We  say  that  x  is  a  left-member 
of  R  whenever  there  is  a  y  such  that  xRy. 

x  Lm  R  3-y(xRy) 

For  instance,  if  'x  parent  y'  means  that  x  is  a  parent  of  y,  then 
'Socrates  Lm  parent'  means  that  Socrates  is  a  parent.  Right- 
member  and  member  are  defined  analogously: 

y  Rm  R  3x(xRy) 

z  Mm  R  <->  z  Lm  R  V  z  Rm  R 

These  satisfy  the  identities: 

x  Lm  R  x  Rm  R~j 

y  Rm  R  <->  y  Lm  R_i 


4.  Functions 


4. 1  basic  concepts 

Functions  and  relations  are  closely  related.  Consider  the 
predecessor  relation,  'pred': 


x  pred  y  x  =  y-1 


-  5  - 


Thus,  x  pred  y  says  that  x  is  the  predecessor  of  y. 
corresponding  arrow  diagram  is: 

1  2  3  4  5 


and  the  corresponding  table  is: 


since  1  pred  2,  2  pred  3,  etc.  Notice  that,  in  this  case,  for 
each  right  member  x  there  is  a  unique  left  member  y  such  that 
y  pred  x.  This  y  can  be  written  using  Whitehead  and  Russell's 
[16]  definite  description : 

7 y (y  pred  x) 

This  can  be  read:  the  y  such  that  y  is  a  predecessor  of  x.  A 
more  convenient  way  to  write  this  is: 

pr ed :  x 

In  general,  R:x  means  "the  unique  y  such  that  y  R  x,  i.e. 

R:x  =  7y(yRx) 

This  notation  is  meaningful  only  if  there  is  a  unique  y  such  that 
yRx ,  i.e. 

yRx  A  zRx  ==>  y=z 

That  is,  there  is  only  one  arrow  leading  to  x.  When  this  condi¬ 
tion  is  satisfied  for  all  x  we  call  R  left  univalent,  symbolized 
by  1 lun ' : 

RSlun  Vxyz[  yRx  A  zRx  =»  y=z  ] 

The  left  univalent  relations  are  more  commonly  called  functions . 
In  a  left  univalent  relation  there  is  exactly  one  arrow  leading 
to  each  node.  Consider  the  "absolute  reciprocal"  relation: 

xRy  <->  x  =  | 1/y I 


This  is  diagrammed: 


I 


1 /£ 


1 Is 


Since  RGlun  it  is  meaningful  to  write  R:x,  so  we  observe  R:(-3)  = 
1/3.  We  can  find  R:x  by  following  back  the  arrow  pointing  to  x 
or  by  looking  down  the  right  column  for  x  and  taking  the 
corresponding  element  from  the  left  column. 

The  concepts  of  right  univalence  and  bi-univalence  are 
defined  analogously: 

RGrun  <-•»  Vxyz  [  xRy  A  xRz  =»►  y=z  ] 

RGbun  4->  RGlun  A  RGrun 

Bi-univalent  relations  are  also  called  bijections  and  one-one 
mappings . 

4 . 2  higher  level  functions 

Of  course,  the  converse  of  a  function  is  not  necessarily  a 
function.  The  'sin'  relation,  defined  so  that  y  sin  x  means  that 
y  is  the  sine  of  x,  is  left  univalent  but  not  right  univalent. 
Hence,  we  can  write  either  y=sin:x  or  y  sin  x,  but  can  express 
the  arcsine  only  by: 

x  sin-1  y 

The  notation  sin-1:y  is  meaningless.  Since  f:x  is  meaningful 
only  when  fGlun  we  will  be  careful  to  write  f:x  only  when  we  have 
previously  shown  (or  it  is  obvious)  that  fGlun  and  x  Rm  f. 

The  fact  that  f:x  may  be  meaningless  makes  it  convenient  to 
use  several  other  relations  derived  from  f.  One  of  these  is  the 
plural  description .  If  F  is  any  relation  and  C  is  a  class  then 
F! :C  is  the  set  of  all  y  such  that  yFx  for  some  x  in  C,  i.e., 

F!  =  l^[z  =  ?[3-x(yFx  A  xGC)  ]  } 

The  tabular  interpretation  of  F! :C  is  simple: 


1 

1 

1 

-1 

1/2 

2 

1/2 

-2 

1/3 

3 

1/3 

-3 

: 

• 

J 
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We  see  that,  if  F  is  any  function,  then  F!:S  is  the  image  of  the 
class  S  under  that  function.  Notice  that  the  operation  F!:S  is 
defined  for  all  relations  F  and  classes  S,  regardless  of  whether 
F€lun  or  the  members  of  S  are  right  members  of  F.  For  these  rea¬ 
sons,  it  is  generally  safer  to  write  F! :C  than  F:x. 

Related  ideas  are  the  image  and^converse  image  of  an  indivi¬ 
dual.  If  R  is  a  relation,  then  c  R  x  means  that  c  is  the  class 
of  individuals  related  to  x.  This  class  is  called  the  referents 
of  x,  and  is  defined: 

if:  x  =  £(yRx) 

The  converse  idea  is  that  of  the  relata  of  y: 

R:y  =  x(yRx) 

4. 

Like  the  plural  description,  R  and  R  are  defined  for  all  R  and 
all  arguments. 

Consider  now  the  function 

=:x  =  ?(y*x) 

Hence,  =:y  is  just  the  unit  class  containing  y.  Russell  and 
Whitehead  [16]  write  thTSTT:y.  Conversely,  if  C  is  a  single  ele¬ 
ment  class,  then  (=-1):C  selects  the  unique  member  of  that  class: 

(=-1):C  *  7x(x6C) 

It  is  thus  a  uniqueness  filter.  We  will  write  this  as  ©:C: 


The  expression  9:C  can  be  read  "the  C" . 

We  will  occasionally  need  to  refer  to  the  relations  that 
hold  between  R  and  R  or  R  and  R\  which  we  write  -  and  respec¬ 
tively: 

R  ^  S  <->  R  =  Jf 
R  -  S  <-»  R  =  S’ 

properties  of  these  operations: 


to  have  names  for  domain  extracting  func- 
the  class  of  left  members  of  R.  These  are 


The  following  are  some 


3 

R 

!?:y 
Tf  = 
It  = 


=  R 
=  R 
*'■ 
-:  R 
-:  R 


(=:y) 


It  is  often  convenient 
tions,  e.g.,  lem:R  is 


J 
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simply  defined  using  images: 

1 em  =  Em 
rim  =  Km 
mem  =  Mm 

Of  course  the  right  and  left  members  of  a  relation  can  be 
obtained  by  taking  its  right  and  left  columns,  respectively,  and 
deleting  duplicates. 


R 


We  will  next  investigate  ways  of  combining  relations  and 
classes.  The  simplest  methods  are  just  abstractions  of  the  logi¬ 
cal  connectives  used  between  propositions:  Therefore,  we  define 
the  intersection,  union,  negation  and  difference  of  classes  and 
relations : 

x(S  A  T)  xS  A  xT 

x(R  A  S)y  «-»  xRy  A  xSy 

x (S  V  T)  «-»  xS  V  xT 
x(R  V  S)y  «-»  xRy  V  xSy 

x ( -S )  -(xS) 

x(-R)y  -(xRy) 

x (S-T)  «-»  xS  A  ~(xT) 

x (R-S ) y  «-»  xRyA-(xSy) 

x(S-»T)  <-»  xS  -»  xT 

x(R-»S)y  4-»  xRy  -*  xSy 

As  an  example  of  the  use  of  these  operations,  consider  our  previ¬ 
ous  definition  of  Mm: 


z  Mm  R  z  Lm  R  V  z  Rm  R 


TS"  *Uf*'*K. 


-  9  - 


Using  the  union  operation  this  can  be  written: 

Mm  -  Lin  V  Rm 


Similarly, 


bun  =  lun  A  run 

The  logical  connectives  satisfy  the  usual  properties  of  a  Boolean 
algebra  (e.g.,  DeMorgan's  theorem). 

As  an  example  of  the  use  of  these  operations,  we  will  define 
the  closed  interval  function,  m..n,  which  is  the  set  of  integers 
m,  m+1,  ...,  n.  It  is  just: 

m .  .n  =  >^:m  A  <_: n 

where  _>  and  <  are  the  relations  on  integers. 

5. 2  empty  classes  and  relations 

It  is  useful  to  have  names  for  the  empty  class  and  relation: 


$  =  S(x^x) 

9  =  x?(x/x) 

Hence,  x$  is  always  false,  as  is  x©y.  These  are  most  often  used 
for  stating  properties  of  relations  and  classes.  For  instance, 

SAT  =  $ 

means  that  classes  S  and  T  have  no  members  in  common. 

The  universal  classes  and  relations  are  also  useful: 

¥  «  -$ 

©  =  -© 

For  instance, 

S  V  T  =  I 


means  that  every  individual  is  either  a  member 
Notice  that  the  class  of  the  right  members  of 
the  image  of  the  universe  under  that  relation. 


of  S  or 
a  relation 
i  .e  . , 


of  T. 
is  just 


rim:R  =  R!  :$ 
lem:R  *  (R-*) ! :} 

mem:R  =  (R  V  R“*) ! :$ 


It  is  often  useful  to  have  the  maximum  relation  that  can 
hold  between  two  classes,  i.e.,  the  Cartesian  product  of  those 
classes.  This  is  defined: 

S|T  =  x?(xS  A  yT) 

The  Cartesian  product  satisfies  the  following  properties: 

(sit)”1  =  t|s 

lem:  (s|t)  =  s 

rim:  (s|t)  =  t 

mem: (s|t)  =  s  V  t 

s|(t  A  u)  »  (s|t)  A  (s|u) 

s|(t  V  u)  =  (s|t)  V  ( s | u ) 

sl(t-u)  »  (sit)  A  (s|-u) 
s|(tH>u)  =  (s|-t)  V  (slu) 

©  =  111 

s|$  =  $|s_  =  $1$  =  © 

Sit  =  (s|$)  A  (lit) 

5. 4  subset  relation 

Finally,  we  define  the  subclass  and  subrelation  operations: 

SCT  -«->  Vx(xS  xT) 

RCS  Vxy  ( xRy  xSy) 

The  following  are  true: 

sCt  — >  (s|u)C(t|u) 

set  (rls)C(rlt) 

sCt  A  uCv  (s|u)C(t|v) 

rGRel  -»  rC© 

rGRel  — >  ©Cr 

sGCls  — »  sC| 

sGCls  — >  $Cs 

where  Cls  is  the  class  of  all  classes  and  Rel  is  the  class  of  all 
relations  (we  are  ignoring  typing  here).  These  can  be  defined: 

Rel  =  £[:  §  =  © 

Cls  3  C:$  =  C:$ 


6.  Limiting  and  Restriction 

It  is  often  useful  to  limit  the  left  or  right  domain  of  a 
relation.  Consider  the  relation  x  sin”1  y,  which  means  that  x  is 
an  arcsine  of  y.  We  cannot  write  x  =  sin-1:y  because  sin”1  is 
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not  left  univalent  (i.e.  it  is  not  a  function).  If  we  restrict 
y,  the  argument  of  sin,  to  the  range  —it/ 4  to  ir/4,  then  there  is  a 
unique  x  such  that  x  sin~*  y.  Let  S  be  the  class  of  reals  in  the 
range  — w/4  to  ir/4: 

xS  (-ff/4£x)  A  (xOr/4) 

then  we  will  write 

sin>S 

for  the  sine  function  with  its  arguments  restricted  to  S.  This 
function  is  bi-univalent,  so  it  is  invertible.  If  we  call  the 
inverse  of  this  restricted  sine  Arcsin: 

Arcsin  =  (sin>S)“^ 

then  it  is  perfectly  meaningful  to  write  Arcsin:x  (if  x  Lm  sin). 
The  right-restriction  operation  is  defined: 

x (R>S ) y  <-»  xRy  A  yS 

The  left-restriction  is  defined  analogously: 

x(S<R)y  xS  A  xRy 

These  notations  can  be  combined  to  restrict  both  domains: 

x  (S<R>T)  y  xS  A  xRy  A  yT 

The  combination  s<R>s  is  so  common  that  a  special  notation  is 
provided  for  it: 

RXs  =  s<R}s 

For  instance,  <XP,  where  xP  x>0,  is  the  less-than  relation 

restricted  to  positive  numbers. 

The  restriction  operations  are  easily  defined  in  terms  of 
intersection  and  Cartesian  product: 

s<r>t  =  r  A  (sit) 
rXs  =  r  A  (sis) 
s<r  =  r  A  (si|) 
r>s  =  r  A  (lis) 


Other  properties  satisfied  by  these  operations  are: 


sit  =  s|§}t 

lem; (s<r)  * 

rim: (r>s)  * 

lem: (r>s)  * 

rim: (sir)  * 

s  A  lem:r 
s  A  rim:r 
r !  :  s 
r'1 !  :s 

(sir)'1  « 

( r'1 )  >s 

(slrjtl-1  = 

(rSs)'1  = 

tl ( r'1 ) }s 
(r“1)»s 

r  >s  =  ?>s 

sir  *  sir 

r}s  A  r}t  » 

r>  (s  A  t) 

r>s_V  r>t  * 

r>  (s  V  t) 

(r|$)>s  = 

r|s 

7.  Relative  Product 


If  xRy  is  the  relation  "x  is  a  son  of  y"  and  xSy  is  the 
relation  "x  is  a  brother  of  y",  then  the  relative  product ,  R|S, 
is  the  relation  "x  is  a  son  of  a  brother  of  y."  More  formally, 

R I S  -  22{ly(xRy  A  ySz)  } 

Where  there  is  little  chance  of  confusion,  we  will  write  RS  for 
R I S .  If  f  and  g  are  functions  it  is  easy  to  see  that  f|g  is  the 
composition  of  these  functions: 

x  *  fg:z 
4-*  x  fg  z 

!y(x  f  yAy  g  z) 
ly(x=»f  :y  A  y*g:z) 

4-»  x  *  f : (g:z) 

Hence,  fg:x  *  f:(g:x). 

It  is  convenient  to  have  a  notation  for  relative  products  of 
a  relation  with  itself.  For  instance,  the  "grandparent"  relation 
can  be  written  parent  I  parent ,  which  we  abbreviate  parent2.  In 
general , 


=* (mem : R) 

R 

=*  (Rn)R  = 
(R")'1 


R  (Rn ) 


Some  obvious  properties  of 


the  relative  product  are: 


( rs) t  « 
r  (s  v  t) 
(r  V  s ) t 
r(s  A  t) 
(r  A  s)  t 
3-(rs) 


r  (st) 

=  rs  V  rt 
=  rt  V  st 
C  rs  A  rt 
C  rt  A  st 
3(rim:r  A 


lem:s) 


(rs)*1 
^m^n  _ 

(rm) n  « 
rmrn  C 

r  f  —  — 


*  (s'1) (r"1) 

rm+n  (m,n>0) 
rmn  (m,n>0, 
rm+n  (rSbun) 
r* 1 r  =  r° 


or  rSbun) 
(rSbun) 


lem:rs  C  lem:r 
rim:rs  C  rim:s 
Lm  =  Rm' 

Rm  *  Lm ' 

r9  *  9r  *  9 

rl  *  ir  *  c  where  I  =  x?(x=y) 


8.  Structures 


We  have  previously  seen  the  use  of  arrow  diagrams 


represents  the  relation  R 
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R 


a 

g 

b- 

f 

c~ 

e 

d r 

~ d 

d 

e 

e 

r 

f 

~ 

£~ 

r 

iZ 

g 

~h 

8.1  initial  and  terminal  members 


Now,  notice  that  the  left  and  right  members  of  R  are: 

lem : R  =  {a,  b,  c,  d,  e,  f,g} 
rim:R  =  {g,  f,  e,  d,  i,h} 

We  define  the  initial  members  of  R  to  be  those  members  which  are 
not  pointed  at  by  an  arrow.  Therefore,  the  initial  members  of  R 
are  the  left  members  that  are  not  right  members. 

init:R  =  (Lm-Rm) : R  =  {a,  b,  c} 

The  terminal  members  of  a  relation  are  defined  analogously: 

term:R  =  (Rm-LirT)  :  R  =  {h,  i} 

When  a  relation  is  used  to  represent  a  data  structure,  the  above 
functions  become  important. 

For  instance,  a  sequence  is  represented  by  a  relation  with 
the  structure: 


In  this  case  init:S  is  the  unit  class  containing  the  head  (first 
element)  of  the  relation  (i.e.,  a^)  and  term:S  is  the  unit  class 
containing  the  last  element  of  the  sequence  (i.e.,  an) .  Simi¬ 
larly,  S}(-init:S)  is  the  sequence  with  its  first  element 
deleted : 


a  2 


a  3 


an-l  an 


Hence,  the  following  common  sequence  manipulation  functions  can 
be  defined  (represented  by  lower  and  upper  case  alphas  and  ome¬ 
gas)  : 
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oc:  S 

=  9  init:  S 

"first" 

lu:  S 

=  9  term:  S 

"last" 

a:S 

=  S> (-init : S) 

"final" 

A:  S 

=  (-term:S)<S 

"initial 

The  following  properties  of  these  relations  are  easy  to  show: 

oc  =  uu' 
m  =  oc' 

A '  =  'a 

Ql'  =  'A 


More  operations  on  sequences  are  discussed  in  the  next  section. 


As  another  example  of  the  use  of  'init*  and  'term',  consider 
a  relation  representing  a  tree: 

a 


T  = 


Then,  9  init:  T  is  'a',  the  root  of  the  tree,  and  term:T  is  {d, 
h,  i,  f,  j,  k},  the  leaves  of  the  tree.  The  result  is  analogous 
for  forests.  Given 


F  = 


v  w 
u 


the  set  of  roots  is  init:F  and  the  set  of  leaves  is  term:F: 
init:F  =  {a,i,g} 

term:F  =  {c , e , f ,g ,h , j , k , 1 ,m , n , t , u , v , w} 

8. 2  higher  level  operations 

The  set  of  nodes  whose  parent  is  n  is  just  F_1:n.  For 
instance,  the  set  of  nodes  directly  descended  from  a  root  is 

F_1! : (init:F)  ■  { b ,h , j ,o , p , r } 

The  set  of  nodes  that  point  to  leaves  is 


F l : (term:F) 


{b,d ,a , i ,o ,p,s  ,r } 


V  *; 


fill!"!. I  JJ .  ji  RfPIQUJ 
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These  operations  can  be  used  for  obtaining  the  maximum  and 
minimum  of  sets.  Suppose  '<’  is  the  less-than  relation  on 
integers  and  S  is  some  set  of  integers,  say  {3,5,9}.  Then 

<»S  = 

3  5  9 

Now  note  that 

init:  (<5S)  =  {3} 

term :  (CSS )  =  { 9 } 

Hence,  if  S  is  any  set  of  numbers,  then  the  minimum  and  maximum 
of  this  set  are: 


min  :S  =  oc:  (CSS) 
max  :  S  =  uu:  (CSS ) 

These  operations  are  only  defined  if  S  has  two  or  more  elements, 
since  an  irreflexive  relation  cannot  relate  less  than  two  ele¬ 
ments.  That  is,  an  irreflexive  relation  when  restricted  to  a 
unit  or  empty  class  becomes  the  empty  relation.  Notice  that  we 
can  select  the  maximum  and  minimum  based  on  any  relation  that  is 
a  series  (i.e.,  transitive,  irreflexive  and  connected).  If  R  is 
any  series  then  oc: (RsS)  is  the  minimum  (relative  to  R)  and 
ui:  (RSS )  is  the  maximum. 

The  following  are  simple  properties  of  these  operations: 

ini t : r  *  term :  (r“| ) 
term:r  =  init:  (r_1) 
init  C  lem 
term  C  rim 

init:(rXs)  *  term: (r-1Xs) 


init : (r  V  s)  C 
init:r  A  init:s 
term:  (r  V  s)  C 
term : r  A  term :s 
init: (sit)  * 
term:  (sit)  = 


init:r  V  in.it:s 
C  init:  (r  A  s) 
term:r  V  term:s 
C  term:  (r  A  s) 

ini t : (s|t)  1  = 


init: (t|s) 


t-s 


9.  Sequences 


9 . 1  ordinal  couples 

In  this  section  we  will  continue  the  discussion  of  sequences 
begun  in  the  last  section.  We  saw  that  it  was  easy  to  define  the 
following  operations  on  sequences: 
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oc:S  =  9  init:S 

uu:S  =  9  termsS 

.ft: S  =  (-init  :S)  <S 
A:S  =  S}  ( -  term:  S ) 

This  provides  us  with  functions  for  taking  sequences  apart.  We 
will  define  the  ordinal  couple  or  pair ,  which  puts  them  together. 
If  x  and  y  are  two  objects,  then  'x,y'  is  the  relation  that 
relates  x  and  y  but  no  other  objects. 

(x,y)  =  ►  ~  > 

x  y 

That  is,  u(x,y)v  if  and  only  if  u=x  and  y=v.  This  is  formally 
defined  by: 

x  ,y  =  uv(u=x  A  v=y) 

This  notation  will  be  taken  to  be  right  associative,  i.e., 

x,y,z  =  x,(y,z) 


Observe  that 


oc:(x,y)  =  x 
ui:(x,y)  =  y 

It  will  occasionally  be  convenient  to  write  ordinal  couples  in  a 
vertical  format: 


(?) 


y)  =  (X,y) 


This  notation  is  extended  for  relations  of  more  than  one  pair 
(*l  x2  •••  x. 


xi  x 2 


V  ...  V 


The  class  of  all  the  ordinal  couples  (or  pairs)  that  can  be 
made  from  the  classes  S  and  T  is: 


SXT  =  £G-xy[xSS  A  yST  A  P=(x,y)  ]) 

There  is  obviously  a  close  relation  between  sXt  and  s<t.  Later 
we  will  say  that  s<t  is  a  Currying  of  sXt.  Note  that 

(x,y)6SXT  x  [S<T]  y 

We  will  define  a  convenient  notation  for  sequences  of  two  or 
more  elements: 

<  *1'  *2 .  xn>  a  <xi'X2)  V  (x2,x3)  V  ...  V  (xn_lfxn) 


Therefore  the  sequence  <a,b,c,d,e>  is  just 
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Also,  note  that. 


<xl'  x2' 


'  *">  =  *2  x3 


cn-l  xn 


9. 2  catenation  and  consing 

If  s  and  t  are  sequences  then  we  can  define  an  operation 
's*t' ,  which  is  the  catenation  of  s  and  t.  To  form  this  catena¬ 
tion  we  must  hook  the  last  element  of  s  to  the  first  element  of 
t: 


Therefore  x  [s^t]  y  if  and  only  if  x  s  y,  or  x  t  y,  or  x=iu:s  and 
y=oc:t.  Hence, 

s*t  =  s  V  (uu:s,  oc:t)  V  t 

The  catenation  operation  is  only  defined  for  sequences, 
which  are  required  to  have  at  least  two  elements  (since  an  irre- 
flexive  relation  with  less  than  two  elements  is  the  empty  rela¬ 
tion)  .  Note  that  we  can  extend  the  definition  of  sequences  so  as 
to  allow  length  one  sequences  by  making  the  relation  reflexive. 


s  V  (=  X  mem:s) 


— D 


The  one  element  sequence  is  then: 


s0  =  (s0'  s0) 

The  full  ramifications  of  this  definition  of  sequence  have  not 
been  investigated. 

How  do  we  add  a  single  element  to  the  left  or  right  of  a 
sequence?  The  "cons  left"  and  "cons  right"  operations  are  easy 
to  define: 

X  cl  Si  s_  X  Si  3_ 
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xcls  =  (x ,  oc:s)  V  s 
s  cr  y  =  s  V  (lu:s,  y) 

It  is  easy  to  show  that  if  s  is  a  sequence,  then: 

oc:  (x  cl  s)  =  x 
H:  (x  cl  s)  =  s 

lu:  (s  cry)  =  y 
A: (s  cry)  *  s 

(oc:s)  cl  (H:s)  =  s,  if#:s>2 
(A  :s )  cr  (iu:s)  =  s,  if  #:s  >  2 

Also,  if  s  is  a  sequence,  then  s  V  (tu:s,  oc:s)  is  a  ring  formed  by 
joining  the  last  element  of  s  to  the  first  element. 

If  s  is  a  sequence,  then  s-^  is  the  reverse  of  s.  Hence, 

rev:s  =  s-* 

oc:  s  =  iu:s~l 
uu:s  =  oc:  s-1 

A : s  =  (a:3’J)“J 

H:s  *  (A : s"1 ) 

(s^t)”1  -  t"1"s“1 

(x  cl  s)~j  =  s'1  cr  x 

(s  cr  x)"1  =  x  cl  s-1 

(x,y)_1  =  (y,x) 

<  XX,  X2,...,  xn>_1  =  <  xn,...,  x  2 ,  x:> 

/x  1  *2  •••  *n\  j.  fy  1  V2  •••  yn^ 

lyl  y2  ynJ  =  VX1  *2  •••  *„j 

If  S  is  a  sequence  and  x  Mm  S,  then  S_1:x  is  the  successor 
of  x  in  S  and  S:x  is  the  predecessor  of  x  in  S  (if  these  exist). 

S-*:x  =  successor  of  x  in  S 

S:x  =  predecessor  of  x  in  S 

These  are  convenient  ways  of  moving  around  within  a  sequence. 
Also,  note  that  if  s  is  a  subsequence  of  t  then  sCt . 

Some  additional  identities  are: 

(?)  00  -  (5) 

oc!  :  (SXT)  =  S 
uu!  :  (SXT)  *  T 
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Finally,  we  will  state  the  formal  definition  of  a  sequence: 
a  relation  is  a  sequence  if  it  is  a  connected  irreflexive  bijec- 
tion.  That  is, 


sequence 

= 

connex  A 

i  r  ref  1  A 

bun 

seirref 1 

s°  C  s~ 

1 

sSconnex 

lem:s  = 

in i t :  s  V 

r  im  :s 

A 

rim:s  = 

term:s  V 

lem  :s 

10.  Binary  Operations 
10.1  basic  concepts 

In  this  section  we  will  discuss  our  approach  to  binary 
operations  -  that  is,  to  functions  with  two  arguments  and  one 
result.  We  have  already  seen  how  unary  functions  are  connected 
to  relations.  For  instance,  we  can  write  the  fact  that  y  is  the 
sine  of  x  by  either: 


y  sin  x 


or 


y  =  sin  :x 

Since  we  only  deal  with  binary  relations,  we  will  have  to  have  a 
new  convention  for  handling  binary  functions.  This  convention 
is:  we  will  combine  the  two  arguments  of  an  operation  into  a 

pair.  For  instance,  we  can  define  a  relation  'sum'  such  that 

x  sum  (y,z) 

if  and  only  if  x  is  the  sum  of  y  and  z.  More  formally: 

sum  =  2a(a=(y,z)  A  x=y+z) 

We  can  use  our  colon  convention  as  usual,  e.g., 

x  =  sum: (y,z)  x  sum  (y,z) 

Now,  it  would  be  inconvenient  to  have  to  invent  names,  such  as 
'sum',  for  each  operation,  such  as  '+'.  Hence,  we  will  adopt  a 
systematic  convention  for  making  such  names:  either  placing  the 
conventional  infix  symbol  for  the  operation  in  parentheses  or 
underlining  the  symbol.  For  instance, 

x+(y,z)  «->  x  =  +  :  (y,z)  x  =  y+z 

In  fact,  if  it  is  any  infix  operation  symbol,  we  will  explicitly 
define  its  meaning  by 


xiry 


it:  (x,y) 


This  notation  will  permit  us  to  manipulate  in  a  more  regular 
fashion  the  usual  arithmetic  operations  (  +  ,  -,  *,  /)  as  well  as 
the  relational  operations  (e.g.  A ,  V,  X.-  {,  >,  8,  $,  '  ,')•  For 
instance,  if  S  is  a  class  of  classes,  then 


(A)  ! :SXS 


is  the  class  of  all  pairwise  intersections  of  members  of  S. 

10.2  operations  on  binary  operations 

It  is  often  convenient  to  be  able  to  generate  simple  rela¬ 
tions  from  a  binary  operation.  Following  Russell  and  Whitehead, 
let  ir  represent  any  binary  operation.  We  define: 


Hence, 


irz  =  x9(x  =  yirz) 
yir  =  x2(x  =  yirz) 


x (-1 ) y  <-»  x  =  y-1 


therefore  (-1)  is  the  predecessor  relation.  Similarly, 

x(l+)y  «-»  x  *  1+y 

therefore  (1+)  (or  (+1))  is  the  successor  relation.  These  can  be 
used  as  functions: 


(-1 ) :x  ~  x— 1 
(+1 ) : x  =  x+1 


This  convention  makes  it  very  easy  to  form  more  complex  func¬ 
tions.  For  instance,  if  we  want 


then  we  can  define 


f  :x 


f 


To  see  that  this  works: 


f :x  =  (sin (1/) ] :x 

=  sin :  [  (1/) :x  ] 
=  sin : [  1/x  ] 


sin : (1/x) 


sin ( 1/) 


Now  observe  the  action  of  the  (x,)  and  (,y)  functions: 
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Therefore,  for  any  binary  operation  it  (except  we  can  define 


ITZ 

yir 


Let's  see  why  this  works: 


(yir)  : z 


[£(y / ) ]  :  z 
it:  [  (y , )  :  z] 
it:  [y,z] 
yirz 


rr  ( ,2) 

I(y,) 


(irz)  :y  is  analogous.  In  general,  if  f  is  a  binary  function,  then 
f(x,)  and  f(,y)  are  the  "partially  instantiated"  unary  functions. 
This  is  the  effect  of  Curry  and  Feys  "B"  combinator  [5]. 

Since  S-1  is  the  reverse  of  a  sequence,  trl'  is  the  reverse 
form  of  an  operation.  For  instance,  -'  is  the  reverse  subtract 
operation: 


: (x,y)  «  -:  ( '  : (x,y)  ) 

=  (y,x) 

=  y-x 

Thus  -'  can  be  read  "subtract  from"  and  /'  can  be  read  "divide 
into".  This  is  Curry  and  Feys  "C"  combinator  (see  the  next  sec¬ 
tion)  . 


11.  Combinators 


In  this  section  we  will  discuss  several  powerful  operations 
for  manipulating  relations.  These  are  called  combinators  because 
of  their  similarity  to  the  combinators  of  Curry  and  Feys  [5]. 


The  first  combinator  we  will  discuss  is  the  paralleling  of 
relations,  *»,  which  is  defined: 

(v)(s)(y)  uRx  A  vS^ 


So,  if  f  and  g  are  functions. 


Hence,  L  is  the  element-wise  combination  of  f  and  g. 
pie,  ifJwe  want  f:(x,y)  =  sin:x  +  sin:y,  we  can  write 


For  exan- 


f 


(+) 


sin 


since 


23 


f:(x,y)  =  (+)§£§:  f*) 

=  (+)=<  <y» 

=  f  + )  .  ( sin  :x\ 

-  sin:x  +  cos:y 

One  of  the  simplest  combinators  described  by  Curry  and  Feys 
is  the  elementary  cancellator ,  K,  defined  so  that  K:x  is  a  func¬ 
tion  such  that  (K:x) :y  =  x  for  all  y.  That  is,  K  generates  con¬ 
stant  functions.  Since  K:x  is  a  relation  that  relates  x  to 
everything,  we  cm  define  it: 

K  =  (||)i 

where  i  =  is  the  unit  class  generator.  To  see  that  this 

works,  note  that 


K:x  =  ($J)i:x  =  (i:x)tl> 


and  therefore  that 

u (K : x) v  <-»  u((i:x)|$]v 

u€i:x  A  v€$  u=x 


Therefore,  (K:x):v  =  x. 

Another  combinator  is  the  elementary  duplicator,  w,  defined 
so  that 


(W:f):  x  =  f : (x , x) 


If  we  define  A;x  =  (x,x)  then  it  is  easy  to  see  that  W:f  is  just 
fA*  for  instance,  (*)A  is  the  squaring  function: 

( * ) A: n  =  (*)  s  (A:n)  _ 

=  (*) :  (n,n)  =  n*n  =  n^ 


It  should  be  clear 
j~A.  since 


that  Backus’  [f,g]  combining  form  is  just 


our 


Since  this  combination  is  so  common  we  will  adopt  a  special  nota¬ 
tion  for  it: 


Hence , 


f 

7 


A 


Some  of  the  properties  satisfied  by  these  combinators  are 
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R  T 
S’  TT 

RT 

"  sn 

1  R\  n 

_  Rn 
'  S* 

H  t 

_  RT 
"  ST 

R  Tl 
^  -oi 

_  RTJ 
~  STT) 

ii 

■Ap 

R  . 
S' 

.  s  _  , 

■  TT  * 

■  §l 

=  H 

=  R}(Rm:S) 

«■§) 

=  S} (Rm : R) 

R  _ 
'sr  - 

Fbc| 

-sarj 

cl 

■  Si'1 

cr 

-  4T1 

As  an  example  of  these  combinators  it  is  easy  to  show  that 

t  - 

is  the  function  f:t  *  t^+2t. 

Another  combinator  is  the  meta-application  operator, 
which  corresponds  to  Curry  and  Feys'  S  combinator: 

(f::g):x  =  ( f : x ) :  ( g : x ) 

For  instance,  [ (I) *] ::init  is  the  operation  that  gives  the  set  of 
descendents  of  roots  of  a  forest,  F,  since 

([{!)']: : init) : F  =  ( F”1 ! ) :  ( in i t : F)  . 

The  forma  1 i z ing  combinator ,  $,  is  defined  so  that 

($:  ( f ,a  ,b) ) : x  =  f:(a:x,b:x) 

It  is  easy  to  see  that 


For  instance. 


$:  (f  ,a  ,b)  =  f 

f  =  $:(<  +  ),  (*)A/  2*) 
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is  just  the  function  f:x  *  x2+2x.  This  can  be  written  more 
clearly  using  the  notation  of  our  relational  calculus: 

f  *  (  + 

Another  combinator  defined  by  Curry  and  Feys  is  the  combi- 
nator : 


[•£:  (f  »g)  ]  :  (x,y)  =  f:(g:x,g:y) 

This  is  simply  defined  by 


so  that 

*£  =  (I)T7^A 

Therefore,  if 

f  =  *£:((  +  ),  <*)A) 

then  f : x  =  x2+y2. 

One  final  operation  we  wish  to  define  in  this  section  is 
"Currying".  This  relates  a  relation  to  the  correponding  class  of 
pairs.  If  S  is  a  class  of  pairs,  then  CurrytS,  the  Currying  of 
S,  is  the  relation  R  such  that  xRy  if  and  only  if  (x,y)SS.  For¬ 
mally, 


Curry : S  =  a?((x,y)SS] 

The  inverse  operation,  Curry”^:R,  is  also  useful. 

Some  properties  satisfied  by  these  combinators  are: 

Curry: (SXT)  =  SIT 

(K:x) f  =  K:x 

f(K:x)  =  K:(f:x)  =  Kf:x 

A  A  =  I 


12.  Ancestral  Relations 
12.1  definition 

Carnap  [2]  defines  the  relation  of  a  property  p  being 
hereditary  with  respect  to  a  relation  r: 

p  Her  r  Vxy{x6p  A  x  r  y  -»  yGp} 

r-1 !  :p  C  p 


This  leads  to  the  definition  of  the  ancestral  of  R  of  the  first 
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kind  as  that  relation  which  preserves  all  the  hereditary  proper¬ 
ties  of  R.  This  is  also  called  the  transitive  closure  of  R: 

x  R*  y  x  Mm  BA  Vp(p  Her  R  A  x€p  yGp] 

For  example,  if  xPy  means  that  x  is  a  parent  of  y,  then  xP*y 
means  that  x  is  an  ancestor  (or  the  same  as)  y.  The  ancestra 1  of 
the  second  kind  is  also  useful: 

R+  =  R* | R 

Thus,  P+  means  "ancestor"  in  the  colloquial  sense.  The  easiest 
way  to  visualize  the  meanings  of  the  ancestrals  is  by  their 
expansion  as  infinite  unions: 

R*  =  R°  V  R1  V  R|  V  R]  V  ... 

R+  *  R1  V  R2  V  R3  V  R4  V  . . • 


Here  are  some  useful  properties  of  the  ancestrals: 

R+  =  R*-  (  =  )  *  R*-R° 

xR*y  3-n[n2.°  A  xRny] 


R°  C  R* 
Rn  C  R* 
Rn  C  R  + 

R]R+  « 
R+  C 

D+  — 


,  for  n>0 
,  for  n>0 

:  R  + 

R*  * 

RIR 

R°  V  R  + 


<r*>  -  (r:  }>* 

(R+)  1  *  (R  i)+ 


(ras) 


* 

r  as 


Ancestral  relations  are  always  transitive.  Notice  that  >  and  > 
for  integers  can  be  defined: 

>  =  (1+)* 

>  «  d  +  )  + 

The  ancestral  "fills  out"  all  of  the  paths  in  a  structure.  For 
instance,  if 


then 


Suppose  that  S  is  a  sequence  and  we  wish  to  find  the  first 
member  of  S  which  satisfies  some  property  P.  First  form  the  clo¬ 
sure  S+,  so  that  for  any  two  members  of  S+  we  can  tell  which  is 
first.  Next,  eliminate  from  S+  any  members  that  do  not  satisfy 
P:  S+XP.  Then,  oc: (S+XP)  is  the  first  member  of  S  satisfying  P. 

Next  we  will  consider  a  simple  character  manipulation  exam¬ 
ple:  ^tripping  leading  blanks  from  a  string.  Note  that 
x  (y  cl)  z  means  that  x  is  a  result  of  consing  0  or  more  y’s  on 
the  front  of  z.  Hence, 

z  [  (y  cl)*]'1  x 

means  that  z  is  the  result  of  stripping  one  or  more  y's  from  the 
front  of  x.  To  get  the  desired  result  it  is  only  necessary  to 
restrict  the  left  domain  of  this  function  to  be  sequences  that 
don't  begin  with  a  y.  Suppose  Y  is  the  property  of  beginning 
with  a  y: 

xY  «-»  y=  oc: x  <-»  y  oc  x  x  S  oc:y 
Therefore,  the  function  to  strip  leading  y's  from  a  sequence  is: 

(-oc:y)  <  [(y  cl)*1 

Before  we  leave  the  topic  of  ancestral  relations,  it  will  be 
useful  to  investigate  their  use  as  a  means  of  iteration.  Suppose 
that  F  is  a  function  (i.e.,  left  univalent).  Then,  since 

F*  -  F°  V  F1  V  F2  V  . • . 

we  will  have  yF*x  if  and  only  if  for  gome  n,  y  *  Fn:x.  In  gen- 
eral#  there  may  be  many  such  n,  so  F  may  not  be  left  univalent. 
If  F  is  to  be  a  function,  it  is  necessary  to  pick  a  termination 
condition  T  (a  clgss)  that  is  only  true  for  one  of  F°:x,  F*:x, 
Fz:x ,  . . . .  Then  T<F  is  just  the  function  sought;  it  is  roughly 
equivalent  to 


repeat  F  until  T 


13.  Arrays 

13.1  definition  and  basic  operations 

An  array  is  just  a  function  from  a  contiguous  subset  of  the 
integers  to  some  set  of  values.  If  A  is  an  array  and  i  Rm  A  then 
A:i  is  the  i-th  element  of  A.  Similarly,  if  I  C  $m:A  is  a  set  of 
index  values  then  A!:I  is  the  corresponding  set  of  array  values 
and  A}I  is  the  subarray  of  A  selected  by  those  indices. 

It  is  easy  to  define  multi-dimensional  arrays:  they  are 

just  arrays  whose  elements  are  selected  by  sequences  of  integers, 
e.g.  M:(i,j).  If  M  is  a  two-dimensional  array,  then  M(i,)  is  the 
i-th  row  of  M  and  M(,j)  is  the  j-th  column  of  M.  Also,  if  I  is  a 
set  of  row  indices  and  J  is  a  set  of  column  indices  then  M>(iyj) 
is  the  submatrix  of  M  selected  by  these  sets.  It  is  easy  to  see 
that  M'  is  the  transpose  of  M,  since 

M':(i,j)  =  M:(':(i,j>)  =  M:(j,i) 

More  generally,  if  P  is  a  permutation  function  (i.e.  a  bijection 
from  an  index  set  into  itself)  then  AP  is  the  result  of  permuting 
A  by  P. 

Suppose  x  is  an  element  of  the  array  A  (i.e.,  for  some  i, 
x=A:i).  Then  X:x  is  the  set  of  all  indices  for  which  x=A:i. 
Therefore  we  can  find  the  index  of  the  first  occurence  of  x  in  A 
(i.e.  APL's  iota  operator)  by  minXrx.  In  general,  if  P  is  some 
property  (i.e.  class),  then  A-1S:P  is  the  set  of  indices  of  all 
elements  of  A  that  satisfy  P.  A  sorted  reflexive  sequence  of 
these  indices  is  just  _<  B  (A“*!:P) 

13.2  relation  to  sequences 

It  is  easy  to  convert  arrays  to  sequences  and  vice  versa . 
Suppose  all  the  elements  of  A  are  distinct,  then  A”1  is  a- Func¬ 
tion  that  returns  the  index  of  an  element  of  A.  We  want  to 
define  a  sequence  S  such  that  xSy  if  and  only  if  x  preceeds  y  in 
A,  i.e.  the  index  of  x  is  one  less  than  the  index  of  y. 

xSy  «-»  (A"*:x)  =  (A_*:y)-1 

4-»  (A"1:x)  (-1)  (A-1 :y) 

x [A | (-1 ) | A-1] y 

Hence,  S  =  A(-1)A"*. 

Next,  we  will  consider  the  opposite  process:  converting  a 

sequence  to  an  array.  Suppose  we  have  a  sequence: 
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S 


We  wish  to  convert  this  to  an  array: 


Thus,  Cor  each  element  a^  in  the  sequence,  we  must  find  its  index 
i  in  the  resulting  array.  If  we  can  define  a  relation  R  such 
that  R:a^=i  then  R_1  will  be  the  array  we  seek.  Now  R:a^  is  just 
the  number  of  predecessors  of  a^  in  S.  That  is,  ag  has  no  prede¬ 
cessors,  so  R:a0  =  0;  a^  has  two  predecessors,  so  R:a2  *  2,  and 
so  on.  Since  S  defined  an  immediate  predecessor  relation,  S+ 
defines  an  ancestral  predecessor  relation: 


The  set  of  predecessors  of  any  element  a  is  then  S+:a,  e.g. 

S+:a2  =  (ag,  al^ 

The  size  of  this  class  is  then  the  desired  index: 

#  :  (S  +  :  a  2 )  =  2 

— 3?  — £■  _1 

Hence,  R:a  =  #:(S  :a),  so  R  =  IS  .  Now,  we  know  that  A  is  R  , 
so  we  can  define  the  function  saO  which  converts  a  sequence  into 
a  0-origin  array: 

saO : S  * 

To  produce  a  1-origin  array,  the  only  alteration  is: 

sa:S  =  ( #S  )_1 

13.3  other  array  operations 

Next  we  will  consider  the  concatenation  of  arrays.  If  A  is 
an  array  such  that  A:i  *  a^,  then  we  can  write  A: 

A  -  (a:, 1)  V  ( a  2 »  2 )  V  ...  V  (am,m) 


where  m  is  the  length  of  the  array.  Similarly,  suppose  that  B  is 
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an  n  element  array,  then  the  concatenation  of  these  arrays  is 

A  cat  B  =  (a ^ ' 1 ) V- • • V (am ,m) V (b^ ,m+l ) V. • • V (bn ,m+n) 

We  can  see  that  A  cat  B  =  AVB '  where  B’  results  from  B  by  shift 
its  indices  by  m: 

B1  =  (b^,m+l)  V  V  (bn,m+n) 

How  do  we  compute  B1?  Observe: 

xB'i  xB(i-m)  xB[(-m):i]  xB(-m)i 

Hence,  B*  =  B(-m)  and  A  cat  B  =  A  V  3(-m),  where  m  is  the  length 
of  A.  The  length  of  A  is  just  #rim:A,  so 

A  cat  B  =  A  V  B(-#rim:A) 

We  will  finish  our  discussion  of  arrays  by  investigating  the 
generation  of  sorted  arrays.  Let  S  be  a  set  of  integers  to  be 
sorted,  then  <XS  is  a  structure  which  relates  lesser  elements  to 
greater  elements.  Now  if  x  is  any  element  of  the  se.t,  (<XS):x  is 
the  set  of  all  elements  less  than  x.  Thus  [ #  (Tx3) ] : x  is  the 
number  of  elements  of  S  less  than  or  equal  to  x.  This  is  just 
the  index  of  x  in  the  sorted  array  we  seek.  Hence  if  A  is  the 
sorted  array,  xAi  if  and  only  if  i  [ # (<¥3^ ] x ,  so  A  =  [  if  (<*s)  ]  ~ 1 . 
Cf  course  this  can  be  generalized  to  any  ordering  relation. 


14 .  Scanning  Structures 
14.1  basic  concepts 


In  this  section  we  will  discuss  several  methods  for  scanning 
structures ,  that  is,  for  applying  a  function  to  each  element  of  a 
structure  and  accumulating  the  results.  Since  no  one  method  has 
yet  been  selected,  this  section  should  be  taken  as  a  report  of 
work  in  progress. 

A  general  paradigm  for  processing  a  structure,  such  as  a 
file,  is  the  following: 


1. 

Perform  some  initialization. 

2. 

Read  the  next  (or  first) 

element  of  the  file. 

3. 

Take  this  value  and  the 
values . 

results  of  processing 

the 

previous 

4. 

Process  these  to  yield  new  cumulative  values 
from  step  (2) . 

and 

continue 
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5.  When  the  end  of  the  file  is  reached,  return  the  accumulated 
result  of  processing  all  of  its  elements. 

A  simple  form  of  this  appears  in  APL's  reduction  operation: 

+/V  =  V1+(...(Vn_1+Vn) ...) 

A  more  general  form  is  Backus'  insert: 

s  f :  <x^ ,  . . .  fi^x^.^x^)*.*^ 

Our  first  example  of  scanning  structures  will  be  to  express  this 
operation  in  the  relational  calculus. 

14.2  reduction  of  arrays 

We  are  given  an  n  element  array  A  and  wish  to  compute: 

t  *  A : n  +  A:  (n-1)  +  ...  +  A:2  +  A:1 

where  we  have  assumed  that  the  right  members  of^A  are  l..n.  We 
saw  in  the  section  on  ancestrals  that  T<F*  will  iterate  the 
application  of  F  with  T  used  as  the  termination  condition.  Con¬ 
sider  how  the  analogous  loop  would  be  written  in  Pascal: 

S  :  =  0 ;  i  :  -  0 ; 
while  i/n+1  do 

begin  S  :=  S+A[i];  i  : =  i+1  end 

On  each  iteration  two  functions  are  performed:  S  is  incremented 
by  A[i]  and  i  is  incremented  by  1.  Let's  represent  the  state  of 
the  computation  by  a  pair  (s,i),  where  s  is  the  cumulative  sum  so 
far  and  i  is  the  index  of  the  next  element  to  process.  We  will 
use  F  to  represent  one  processing  step,  so  that,  if  (s',i')  is 
the  new  state,  we  can  solve  for  F  as  follows: 


(  (  +  )£:  (s,i)\ 
\(+l)ui:  (s,i )j 


Hence,  F  * 


<  +  I. 

TPTTUrj- 

(+>ff  I 

(  +  1  JIU  • 


(I) 


r.«w 
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It  remains  to  determine  the  termination  condition,  T.  If  x 
is  a  state,  i.e.,  a  pair  (s,i),  then  xST  when  i=n+l.  Hence,  xST 
when  ui:x  =  n+1,  so 

xST  ui:x  =  n+1 

(n+1)  uj  x 
x  S  uj:  (n+1 ) 

Henge,  T  =  m:  (n+1).  The  final  state,  x^,  containing  the  sum  is 
T{F  :  Xj,  where  Xj=(0,l)  is  the  initial  state: 

xf  =  (T<F*):(0,1) 

Now,  the  total  t  is  just  oc:Xf,  so 

t  =  oc(T{F*)  :  (0,1) 

We  can  generalize  this  to  any  function  f  with  initial  value  i: 
t  =  oc(T<F*)  :  (i  ,1) 


where  F 


f7T 

"  (+'T)UT 


This  result  can  be  improved  by  directly  extracting  the 
result  from  the  figal  state.  That  is,  we  want  to  define  a  filter 
&  such  that  t  =  ^F  :(i,l).  Hence  we  want  tjfXf,  so 

t£xf  t  i  (t,n+l) 

Now,  note  that  (,n+l]:t  =  (t,n+l) ,  so 

(t,n+l)  [ , n+1 ]  t 

by  the  definition  of  Therefore  jrf  *  [,n+l]-1  and  we  have  the 

simplified  formula 

t  =  (,n+l)~1F*:  ( i , 1 ) 

14.3  reduction  of  sequences 

Next  we  will  consider  the  scanning  of  sequences.  Suppose  S 
is  a  sequence: 

S  =  <s1,s2, • • • ,sn,E0F> 

where  EOF  is  an  "end  marker";  it  can  be  any  value.  Now,  we  wish 
to  find  the  result 


that  is 


1  f  s^  f  s2  £  ...  £  sn 


f:(  f :  ( . . .  £«(  i,  s  ^  )  ...),  s^ ) 
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for  some  function  f  and  starting  value  i.  The  state  can  be 
represented  by  a  pair  (t,s),  where  t  is  the  result  so  far  com¬ 
puted  and  s  is  the  rest  of  the  sequence  to  be  processed.  Hence, 
(t',s')  =  F :  ( t ,  s )  where  t'  =  f:(t,oc:s)  and  s'  =  &:s.  Therefore, 


( t  ,oc:  s  ) 
Q.:  s 


) 


(  fde:  (fc's)N) 

\  Qtu:  (t,s)/ 


Hence , 


F 


<55 

nnn 


What  is  a  terminal  state?  Notice  that  H:<sn,EOF> 
terminal  state  will  have  the  form  (r,9).  Hence, 

r  =  (,i)'1F*:  (i,S) 


9,  so  a 


To  put  this  in  a  more  useful  form,  we  will  define  a  function  f@i 
such  that  r  =  (f@i) :S.  This  is  simply 


Then,  the  sum  of  the  elements  of  a  sequence  S  is  just  (+)P0:S. 
14.4  scanning  general  structures 

It  is  often  useful  to  scan  a  structure  while  performing  some 
processing  at  each  node.  When  the  data  structure  is  a  sequence 
this  amounts  to  APL's  reduce  operator  and  Backus'  insert  opera¬ 
tor.  We  will  define  a  scanning  operation  that  works  on  a  more 
general  class  of  structures.  This  operator  can  be  understood 
intuitively  as  follows:  The  state  of  the  scanning  process  is 
represented  by  a  set  of  "read  heads"  each  of  which  is  "positioned 
over"  a  node  and  holds  state  information  accumulated  from  the 
nodes  it  has  already  visited.  A  node  can  be  processed  when  a 
read  head  has  moved  to  that  node  over  each  edge  which  leads  into 
the  node.  When  this  occurs  a  processing  function  is  applied  to 
the  node  (as  first  parameter)  and  the  union  of  the  state  informa¬ 
tion  of  each  of  the  read  heads  (as  second  parameter) .  The  result 
of  this  processing  step  becomes  the  state  information  associated 
with  a  new  set  of  read  heads  which  are  advanced  along  each  edge 
leading  out  from  the  node.  The  processing  of  the  structure  is 
completed  when  all  read  heads  have  arrived  at  terminal  nodes 
(hence  this  scanning  operation  is  not  defined  for  cyclic  struc¬ 
tures).  Scanning  a  structure  is  started  by  positioning  a  read 
head  with  initial  state  information  over  each  initial  node. 


The  scanning  operation  is  symbolized  by  f I i ,  where  f  is  the 
processing  function  and  i  is  the  initial  state  for  the  read 
heads.  For  instance,  if  V  is  a  vector,  (+)I0:V  will  scan  the 
elements  of  V  using  (+)  (i.e.  APL  +/V  or  Backus'  (/+) :V) .  For  a 


more  interesting  example,  suppose  T  is  an  attributed  parse  tree, 
E  is  a  function  that  evaluates  attributes  and  B  is  the  initial 
set  of  attribute  bindings.  Then  EIB:T  propogates  the  values  of 
inherited  attributes  down  to  the  leaves  of  the  tree.  Conversely, 
EIB:(T-^)  propogates  the  values  of  synthesized  attributes  back  to 
the  root.  Hence,  repeated  applications  of  EIB  and  (EI3) *  will 
evaluate  all  of  the  attributes.  Of  course,  this  program  will 
work  just  as  well  if  T  is  a  forest  of  parse  trees.  The  I  opera¬ 
tor  is  still  undergoing  evaluation  as  it  is  one  of  several  possi¬ 
ble  structure-directed  scanning  operations. 


15.  Examples 


In  this  section  we  will  give  several  examples  of  relational 
programs . 

15.1  payroll 

Suppose  we  have  a  file  $  of  employee  records,  where  r  =  $:n 
is  the  record  for  the  employee  with  the  employee  number  n.  We 
will  suppose  that  employee  records  are  functions  defined  so  that: 

r:N  =  employee  name 

r:H  =  hours  worked  so  far  this  week 
r:R  =  pay  rate 

We  are  given  an  update  file  U  such  that  U:n  is  the  number  of 
hours  worked  by  employee  n  today.  We  wish  to  generate  a  new  pay¬ 
roll  file  $ ' . 

SOLUTION:  Let  r  =  $:n  and  r'  =  $':n  be  the  old  and  new 
employee  records.  It  is  clear  that  r'  is  the  same  as  r  except 
for  its  H  field.  In  order  to  modify  part  of  a  relation,  we  will 
use  the  Md  function  defined  by: 

Md:  (S , R)  =  R  V  S>  (-Rm : R) 

Then,  if  h'  represents  the  new  value  of  the  H  field,  the  new 
employee  record  is 


r'  =  Md  :  ( r ,  (h’,H)) 

where  h'  is  just  the  cumulative  hours  worked: 

h'  =  ($:n) :H  +  U:n 

Therefore,  by  the  definition  of 

$'  :n  =*  r  '  =  Md  :  (  $  :n ,  (h '  , H)  ) 


To  find  $'  we  must  factor  out  the  employee  number  n.  To  do  this, 
note  that  ($:n):H  =  (:H):($:n)  *  (:H)$:n.  That  is,  (:H)$  is  a 


slice  of  the  payroll  file:  the  hours  worked  for  each  employee. 
Therefore , 


h '  =  ( $  :  n )  :  H  +  U :  n  =  ( :  H )  $  :  n  +  U :  n 

-  (.)liBli|=„ 

Now,  define  the  updating  function  u  by 
u:n  =  (  (  +  )±l£21j  :  n ,  H  ) 

=  (,H)  (+)llgl4|  :n 

Then,  $':n  =  Md  :  (<$  :n  ,u  :n)  =  Md^:n  Therfore,  the  solution  to 

our  problem,  the  new  payroll  file,  is 

$  where^u  =  (,H)  (  +  )  tsgii| 

15.2  check  issueing 

Suppose  we  wish  to  take  the  payroll  file  from  the  previous 
example  and  generate  checks  for  the  employees.  We  will  assume 
that  a  function  C  is  available  such  that  C: (nm,p)  returns  a  check 
in  the  amount  p  made  out  to  the  name  nm. 


SOLUTION:  We  will  ignore  overtime  computations.  Hence,  if  n 
is  an  employee  number  then  $:n:N  is  his  name  and 

p:n  =  $ : n : H  *  $ :  n  :  R 

is  his  pay.  Hence,  his  check  c:n  is  c:n  =  C:(nm,p:n)  =  C :  ^.nJ1  j 

=  C:(^N)$:nj  =  Ci^li-|:n 

Combining  these  we  have  the  file  F  mapping  employee  numbers  into 
checks : 

f  =  c±i£i! 


m 


from  which  we  can  factor  out  the  old  payroll  file: 


F  =  C 


_lN 


m 


i 


If  we  just  want  a  set  of  checks,  this  is  Lm:F. 


16.  Implementation  Notes 

The  primary  goal  of  our  investigation  has  been  to  determine 
if  relational  programming  is  significantly  better  than 
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conventional  methods.  It  would  be  premature  to  devote  much 
effort  to  implementation  studies  before  it  is  even  determined  if 
relational  programming  is  an  effective  programming  methodology. 
However,  a  brief  discussion  of  implementation  possibilities  is 
probably  not  out  of  line. 

The  most  obvious  representation  of  a  relation  is  the  exten¬ 
sional  representation,  in  which  all  the  elements  of  a  relation  or 
class  are  explicitly  represented  in  memory.  There  are  many  kinds 
of  extensional  representations,  such  as  hash  tables,  binary  trees 
and  simple  sorted  tables.  Of  course,  performance  can  be  improved 
through  the  use  of  associative  memories  and  active  memories  (in 
which  each  memory  cell  has  a  limited  processing  capability). 

Some  relations  and  classes  will  be  so  large  that  it  is 
uneconomical  to  represent  them  explicitly  in  memory.  In  these 
cases  an  intensional  representation  [11]  should  be  used.  Here  a 
class  or  relation  is  represented  by  a  formula  or  expression  for 
computing  that  relation  or  class.  Operations  on  the  class  or 
relation  are  implemented  as  formal  operations  on  the  expression. 
This  is  feasible  because  of  the  simple  algebraic  properties 
satisfied  by  relations.  It  can  be  seen  that  an  intensional 
representation  is  really  just  a  variant  of  a  lazy  evaluation 
mechanism  [9,  10].  Sometimes  an  intensional  representation  is 
necessary;  for  instance,  relations  of  infinite  cardinality,  such 
as  the  numerical  operators  and  relations,  require  an  intensional 
representation. 

Although  the  programmer  could  be  allowed  to  choose  between 
extensional  and  intensional  representations  for  his  relations, 
this  is  not  necessary.  It  is  probably  feasible,  and  certainly 
higher  level,  to  have  the  system  choose  representations  on  the 
basis  of  cardinality  estimates  of  the  classes  and  relations 
involved.  The  algebra  of  relations  is  regular  enough  that  many 
of  these  decisions  can  be  made  at  compile  time.  Any  that  can't 
can  be  deferred  to  run-time  when  exact  cardinality  information  is 
available.  See  [14]  for  related  techniques. 


17.  Conclusions 

Of  course,  we  are  not  the  first  to  propose  introducing 
aspects  of  a  relational  calculus  into  programming.  Codd  [4]  has 
used  a  relational  calculus  as  the  basis  for  data  base  systems. 
Although  he  defines  several  operations  on  relations  (viz.,  premu¬ 
tation,  join,  tie,  composition,  and  restriction),  this  small  set 
of  operations  is  insufficient  for  general  purpose  programming. 
These  remarks  also  apply  to  Childs'  reconstituted  definition  of 
relations  [3],  which  are  also  oriented  towards  data  bases.  Feld¬ 
man  and  Rovner  [6]  augmented  Algol  with  several  relational  opera¬ 
tors  for  associative  access  to  a  data  base.  Their  operations, 
which  are  our  plural  description  and  image,  are  quite  limited, 
being  based  on  a  traditional  von  Neumann  language. 
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One  general  purpose  language  that  does  make  extensive  use  of 
sets  and  relations  is  SETL  [7] ,  which  provides  most  of  the  fami¬ 
liar  operations  on  sets  (e.g.,  union,  intersection,  difference, 
powerset,  image).  SETL  differs  from  relational  programming  in 
three  significant  respects:  (1)  it  can  only  handle  finite  sets , 
(2)  many  operations  must  still  be  performed  in  a  word-at-a-time 
fashion  using  the  set  former ,  and  (3)  it  resorts  to  conventional 
control  structures. 

Finally,  we  must  mention  "logic  programming"  systems,  such 
as  PROLOG  [15,  8],  which  use  predicate  logic  to  describe  computa¬ 
tional  processes.  These  systems  also  differ  from  relational  pro¬ 
gramming  in  several  significant  respects:  (1)  they  have  a  word- 
at-a-time  programming  style  due  to  the  use  of  variables 
representing  individuals  in  the  clauses  of  the  program,  and  (2) 
they  are  implemented  using  a  resolution  theorem  prover,  whereas  a 
more  conventional  procedural  implementation  suffices  for  rela¬ 
tional  programming.  Essentially  the  same  remarks  apply  to 
Popplestone ' s  relational  programming  [13],  which  is  like  logic 
programming  except  that  it  uses  "forward  inference"  rather  than 
"backward  inference". 

In  summary,  no  other  programming  style  that  we  are  aware  of 
combines  the  universal  use  of  relations  with  a  rich  set  of  opera¬ 
tions  on  those  relations  that  can  be  implemented  in  a  determinis¬ 
tic,  procedural  way.  It  is  hoped  that  the  preceeding  discussion 
has  made  plausible  some  of  the  advantages  claimed  for  relational 
programming  in  the  Introduction.  Considerable  work  remains  to  be 
done  in  evaluating  the  effectiveness  of  a  relational  calculus  as 
a  programming  tool.  For  instance,  the  optimum  set  of  combinators 
and  relational  operators  must  be  selected.  Another  non-trivial 
problem  is  the  selection  of  a  good  notation  for  the  relational 
calculus.  More  from  convenience  than  conviction  we  have  used  the 
notation  of  [16]  and  [2].  Making  relational  programming  an 
effective  tool  will  require  designing  a  notation  that  combines 
readability  with  the  manipulative  advantages  of  a  two-dimensional 
algebraic  notation.  This  is  all  preliminary  to  any  serious  con¬ 
siderations  of  software  or  hardware  implementation  techniques. 
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